Search results for "DNA sequences"

showing 5 items of 5 documents

Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences

2018

Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue effects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k − mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classifi…

0301 basic medicineSequenceSettore INF/01 - InformaticaEpigenomic030102 biochemistry & molecular biologybusiness.industryComputer scienceDeep learningPattern recognitionFeature selectionDNA sequencesNucleosomesRanking (information retrieval)Set (abstract data type)03 medical and health sciencesVariable (computer science)030104 developmental biologyDimension (vector space)Feature selectionDeep learning modelsArtificial intelligenceDeep learning models Feature selection DNA sequences Epigenomic NucleosomesRepresentation (mathematics)business
researchProduct

AnABlast: Re-searching for Protein-Coding Sequences in Genomic Regions

2019

AnABlast is a computational tool that highlights protein-coding regions within intergenic and intronic DNA sequences which escape detection by standard gene prediction algorithms. DNA sequences with small protein-coding genes or exons, complex intron-containing genes, or degenerated DNA fragments are efficiently targeted by AnABlast. Furthermore, this algorithm is particularly useful in detecting protein-coding sequences with nonsignificant homologs to sequences in databases. AnABlast can be executed online at http://www.bioinfocabd.upo.es/anablast/ .

Fossil DNA sequencesProtein coding0303 health sciencesGene predictionCoding DNA sequences030302 biochemistry & molecular biologyComputational biologyBiologyGene findingDNA sequencing03 medical and health sciencesExonchemistry.chemical_compoundIntergenic regionchemistryHomologous chromosomeSmall genesGeneIn silico annotation toolDNA030304 developmental biology
researchProduct

Aporocotyle mariachristinae n. sp., and A. ymakara VillalbaFernández, 1986 (Digenea: Aporocotylidae) of the pink cusk-eel, Genypterus blacodes (Ophid…

2012

Aporocotyle mariachristinae n. sp. and A. ymakara Villalba & Fernández, 1986 were collected from the bulbus arteriosus and ventral aorta of pink cusk-eels, Genypterus blacodes (Forster, 1801) from Patagonia, Argentina. A. mariachristinae n. sp. can be distinguished from all the species of Aporocotyle by the asymmetrical extension of posterior caeca (right posterior caecum longer, terminating at the area between mid-level of ovary and posterior body end; left posterior caecum shorter, terminating at the area between mid-level of cirrus sac and posterior to reproductive organs), the distribution of spines along the ventro-lateral body margins and the number of testes. The new species clearly …

RDNA SEQUENCESADN ribosomiqueOphidiiformesA. MARIACHRISTINAE N. SP.Aporocotyle//purl.org/becyt/ford/1 [https]A. YMAKARAFish DiseasesAPOROCOTYLIDAERNA Ribosomal 28SPatagoniaOPHIDIIDAEPatagonieAortaARGENTINAEelsbiologyHeartAnatomyGenypterus blacodesOriginal ContributionA. mariachristinae n. sp.PATAGONIAInfectious DiseasesGenypterus blacodesSÉQUENÇAGETrematodaTrematodaArgentinePATAGONIECIENCIAS NATURALES Y EXACTASGENYPTERUS BLACODESVeterinary (miscellaneous)Otras Ciencias BiológicasArgentinaBulbus arteriosusTrematode InfectionsDNA RibosomalDigeneaMerlucciuslcsh:Infectious and parasitic diseasesCaecumCiencias BiológicasOphidiidaeséquençageRNA Ribosomal 18SAnimalslcsh:RC109-216//purl.org/becyt/ford/1.6 [https]ARGENTINErDNA sequencesAPOROCOTYLEA. ymakarabiology.organism_classificationAporocotyleADN RIBOSOMIQUEAporocotylidaeInsect ScienceAnimal Science and ZoologyParasitologyParasite (Paris, France)
researchProduct

Molecular markers indicate the phylogenetic identity of southern Brazilian sea asparagus: first record of Salicornia neei in Brazil

2019

Abstract Molecular phylogenetic analyses based on ETS, ITS and atpB - rbcL spacer sequences assessed the phylogenetic status of the southern Brazil sea asparagus species of the genus Salicornia (Salicornioideae, Amaranthaceae). Accessions of Patos Lagoon estuary (32° S) were obtained from wild plants and two pure line lineages, selected from contrasting prostrate (BTH1) and decumbent (BTH2) ecomorphotypes found locally. Patos Lagoon wild plants, BTH1 and BTH2 f4 progenies showed 100% identical sequences for the atpB - rbcL and ITS spacers, only two mutations for ETS. Comparison of the sequences of these three markers with GenBank records confirmed the identity of Brazilian accessions as Sal…

SalicorniaDNA sequencesPlant ScienceHorticultureBiologysalt marshesGenusSarcocornialcsh:BotanyBotanyAsparagusCladelcsh:QH301-705.5SalicornioideaePhylogenetic treeAmaranthaceaebiology.organism_classificationlcsh:QK1-989lcsh:Biology (General)marismashalophyteGenBanksequências de DNAhalófitasSalicornioideae
researchProduct

Normalised compression distance and evolutionary distance of genomic sequences: comparison of clustering results

2009

Genomic sequences are usually compared using evolutionary distance, a procedure that implies the alignment of the sequences. Alignment of long sequences is a time consuming procedure and the obtained dissimilarity results is not a metric. Recently, the normalised compression distance was introduced as a method to calculate the distance between two generic digital objects and it seems a suitable way to compare genomic strings. In this paper, the clustering and the non-linear mapping obtained using the evolutionary distance and the compression distance are compared, in order to understand if the two distances sets are similar.

Settore ING-INF/05 - Sistemi Di Elaborazione Delle Informazionibusiness.industryCompression (functional analysis)Metric (mathematics)Normalized compression distanceuniversal similarity metric USM clustering DNA sequences normalised compression distance evolutionary distance genomic sequences nonlinear mapping bioinformaticsPattern recognitionArtificial intelligenceCluster analysisbusinessDistance matrices in phylogenyMathematics
researchProduct